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ABSTRACT 

This application note will discuss techniques used to opti- 
mize high speed Phase Locked Loop (PLL) clock distribu- 
tion networks found in PCs and High end workstations. Stat- 
ic delay along clock paths can vary considerably. By placing 
a precision delay element in the feedback signal path of the 
PLL, the system designer has a method of controlling the 
output edge placement of the PLL. Some of the parameters 
that can affect skew and phase error are feedback length, 
input edge rate, loading and temperature. Data will be pre- 
sented to show how skew and phase error were optimized 
for a specific customer application where National's 
CGS701A was used as a PLL clock distribution driver. 

WHAT IS A PLL? 

A basic Phase Locked Loop (PLL) consists of a Phase De- 
tector (PD), Low Pass Filter (LPF), Charge Pump (CP) and 
Voltage Controlled Oscillator (VCO). When put together, 
these blocks make up a control loop as shown in Figure 1. 
When the PLL is out of lock, the PD changes the frequency 
of the VCO in such a way as to minimize the phase differ- 
ence between the input signal (Vr^p) and the VCO feed- 
back signal (Vpbk). The error signal from the PD, in the form 
of an UP/DOWN signal, is proportional to the phase differ- 
ence between Vpgp and Vpbk. This signal drives the charge 
pump input voltage higher or lower. If Vppp leads Vpbk 
(Vrep > Vpbk) an UP pulse equal to the phase difference is 
produced that accordingly increases the VCO frequency. 
Similarly, if Vrpp trails Vpbk (Vref < Vpbk) a DOWN pulse 
is produced that decreases the VCO frequency. This pro- 
cess continues until the loop is locked which occurs when 
both the rising and trailing edges of Vppp and Vpbk ai's 
aligned and have the same frequency and phase (Vppp = 
Vfbk)- The phase error just described is also referred to as 
propagation delay when an output is used to drive the feed- 
back input. Due to the inherent characteristics and limita- 
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tions of a PLL, such as mismatches in internal edge rates 
and internal chip routing, during phase lock Vrep and Vpbk 
are aligned to within some non-zero value. This non-zero 
value, referred to as part to part skew, will have to be ac- 
counted for by the system designer when final timing re- 
quirements for a system are computed. 
Increasingly complex systems, especially those using high 
frequency multi-processors, require several clocks to be dis- 
tributed to different loads on one or more boards. Figure 2 
illustrates how PLLs can be used to achieve frequency mul- 
tiplication by placing a divider circuit in the feedback path. 
The PLL circuitry adjusts the frequency of Vrpp and Vpbk to 
be equal, so the VCO frequency must be 2x Vppp. Similar- 
ly, Figure 3 illustrates how using the feedback path increas- 
es fanout. The buffer placed in the feedback path allows for 
additional clock outputs all phase locked to Vppp. In each 
example, the PLL will adjust the output clock to ensure 
phase and frequency synchronization between Vrpp and 
VpBK- 

THE CHALLENGES OF CLOCK DISTRIBUTION 
Clock distribution is a significant design problem for high 
frequency systems and play a major role in the system de- 
sign. Synchronous systems, as shown in Figure 4, use a 
single clock source to coordinate the functions of several 
circuits on the board. This also means that the clocks are 
now distributed over greater distances as illustrated by 
traces 1 , 2 and 3 in Figure 4. The challenges for the system 
designer is to get the clock signals to the various destina- 
tions at very nearly the same time without sacrificing signal 
integrity or quality. Clock signals with square edges and zero 
rise and fall times are only found in text books. With real 
systems the low to high output edges do not happen at the 
same time. The difference in time between the earliest and 
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FIGURE 1. Basic PLL Block Diagram 
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FIGURE 2. Frequency Multiplication 



FIGURE 3. PLL Output Buffering 
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FIGURE 4. Clock Distribution on a Board 



the latest output transitions among all outputs is called out- 
put clock skew. Therefore, the designer must minimize out- 
put clock skew while maximizing clock integrity, this is not 
always possible because of intrinsic and extrinsic compo- 
nents that make up output clock skew. Intrinsic delay is at- 
tributed to differences in internal chip edge rates, internal 
chip routing and process variations. Extrinsic delay is attrib- 
uted to the trace length plus the loading at the end of those 
traces. The system designer has control over skew generat- 
ed due to these extrinsic components. 
In general, adding capacitance to a clock output increases 
its propagation delay time. A typical delay number could be 
1 .0 ns per 50 pF of loading. If therefore, one output had a 
25 pF load and a second a 50 pF load then the result would 
be about a 500 ps skew. Adding capacitance to an output 
would also result in an increase in delay due to slower out- 
put rise and fall times. Similarly, skews due to trace delays 
on printed circuit board can be as much as 150 to 200 ps 
per inch. Therefore two traces, one two inches long and the 
other three inches long will result in about a 200 ps skew. 
What can be done to address these limitations? The stan- 
dard approach used currently in most applications, has 
been to use matched trace lengths for clock signals to en- 
sure minimal skew due to different trace lengths. Layout of 
the shorter traces in a serpentine path or alternatively, use a 
delay element on the clock paths. In general, applications 
use one or a combination of these techniques but this has 
not always been easy to achieve because of limited board 
space and the fact that traces of different lengths behave 
differently and require special handling. Traces around five 
inches, depending on frequency and edge rate, behave like 
transmission lines and as such require proper termination to 
avoid reflections and clock signal degradation. 

PLL FEEDBACK DELAY NETWORKS 

As has already been described, PLLs, maintain the phase 
and frequency relationship between the input reference and 
the output clocks by externally hardwiring the feedback out- 
put (FDBKOUT) pin to the feedback input (FDBKIN) pin. The 
PLL circuitry matches the rising edges of the input reference 
and the output clocks. In essence an ideal zero delay buffer 
is realized, but in reality there are output loads and propaga- 
tion delays due to those loads. The goal here is to strive 
toward that ideal zero phase delay between the input refer- 
ence and the output pins. How is this achieved? 



There is no requirement in a PLL that the feedback connec- 
tion be hardwired from the FDBKOUT pin to the FDBKIN 
pin. By placing a delay in this feedback path, adjustments 
can be made to the FDBKOUT such that the information 
being presented to the FDBKIN is now delayed. The PLL 
circuitry now adjusts the outputs including the feedback out- 
put until lock is achieved. This delay in the feedback path 
can be customized by using an RC network and divider, see 
Figure 5. In other words by making this delay equal to the 
phase error between Vpgp and the worst case clock load, it 
is possible to zero out this phase error and align Vrep and 
VLoad- 
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GND 
Where: Cj = C|n -I- Ctr + CpH 
C|M is input capacitance 
CjR is trace capacitance 
CpH is additionai capacitance 

FIGURE 5. RC Tuning Networic 

National's CGS701 A in Figure 5 is used to illustrate how this 
technique was used to optimize for phase delay across 
loads and trace lengths. Before this technique can be used 
an understanding of the following issues are required: 

1 . The type of clock driver being used and its critical specifi- 
cations, e.g., output skew and prop delay. 

2. Determining the layout and loading of the clock signals on 
the board, e.g., short versus long traces. 



We will work through each of these steps using an actual 
clock distribution design example shown in Figure 8. It will 
be assumed that the final system timing requirements are 
known and the maximum allowable skew for error free oper- 
ation is also known. It will also be assumed that the data is 
sampled on the rising edge of clock. 
The CGS701 A is a low skew 1 to 8 PLL clock driver, with an 
external feedback path, providing outputs at 1 x, 2X, and 
4X the input. The output skew is less than 500 ps and 
input-to-output propagation delay of ±300 ps (including jit- 
ter). In addition, the CMOS output drive capability of 
±30 mA gives it an overall performance and flexibility that 
makes the CGS701 A an ideal clock source for this and oth- 
er high frequency design. 

The layout of the board and the loading on the outputs are 
shown in Figure 8. The trace length from the 1 to 8 driver to 
the CGS701A reference input is between 2 to 6 inches. In 
this particular application, the trace length from the 
CGS701A outputs (8) to each one of the loads has been 
kept to approximately four inches. There could be applica- 
tions where the trace lengths are not the same and this 
additional trace delay can be compensated for by either 
phase advancing the longer trace or retarding the shorter 
trace. Examples showing this and other configurations are 
shown in the results section of this application note. 
Variations in output loading increases the output rise and 
fall times therefore clock signal take longer to transition 
from a TTL high or low to the TTL threshold point of 1 .5V. 
An analysis of the CGS701 A showed that when the FDBKIN 
input was connected to the FDBKOUT output with a Yi inch 
trace, the delay between the Vref and FDBKIN was ap- 
proximately —400 ps when measured at the DUT. The mi- 
nus sign indicating FDBKIN signal leading Vr^p. The differ- 
ence in edge-rate between FDBKIN and Vref accounts for 
much of the -400 ps delay. 



Analyzing the RC delay network in the feedback path, it 
must be remembered that the phase frequency detector 
(PFD) of the PLL is positive edge triggered. The FDBKOUT 
signal that drives the PFD and the equations used to deter- 
mine the specific phase delay required is also based on the 
rising edge. Synchronising these two edges, Vref a"d 
FDBKOUT, and their reference points allows us to proceed 
with the selection of the components necessary for phase 
compensation. 

EQ1 R1 + [R1 + R2] X Vo = V| 

R1 and R2 are resistor values in fi 
Vo is the CGS701A output Vqh 
V| is the reference input swing 

EQ2 [RIXCj + R1 + XCj] + 

[R2 + [RtXCj + R1 + XCj]] X Vo = V| 

Equation 1 above represents a simplified version of a volt- 
age divider, the attenuation desired between V| and Vq de- 
termines the ratio of R1 and R2. In general the value of R2 
is in the range from 25fl to 250f!. Equation 2 shows a more 
complete version of the voltage divider. 
Assuming the following conditions: 

1 . If XGj is negligible, EQ2 simplifies and the simple voltage 
divider shown in EQ1 can be used. 

2. lfR1> >R2andR1> >XGt, the delay network shown in 
Figure 7 simplifies to an R2C7 network. 

This allows equation 3 shown below to be used to deter- 
mine the desired phase adjustment "dt". 

EQ3 Vt = V|[1 - £-t- R2C-I-] 

Vj is the PFD switching threshold 
V| is the reference input swing 
R2 resistors in li 
t phase shift required 
Cj capacitance in pF 
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FIGURE 6. Trace Length and Loading 
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RC DELAY NETWORK 




FIGURE 7 
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The phase adjustment process requires several iterations to 
get the final values of R1, R2 and Cj where Cj includes 
input trace, plus any additional capacitance required for 
phase shift. It is left up to the user to use either equations 1 
or 2 to determine these values. It is also recommended that 
the user validate the equations and assumptions used by 
verifying and fine tuning on the bench. In general compo- 
nents used must have tight tolerance (<1%) and prudent 
PCB layout techniques must be used to minimize sources of 
skew/phase error. 

PHASE OPTIMIZATION RESULTS 

The results from the phase optimization for the clock distri- 
bution design shown in Figure 6, using the delay network, is 
shown in this section. In addition, data is also shown here 
that can be used as a guideline by any system designer for 
any application using a PLL with an external feedback path. 
The phase optimization process is not complete until it is 
understood how various other parameters can affect the re- 
sults. We have attempted to provide data that will show how 
some of the following variables: Vqc. frequency, tempera- 
ture and Vrep edge-rate, different loads and load distances 
can affect the optimization process and phase delay. 
With reference to the application being considered here, the 
propagation delay requirements of the system designer was 
± 500 ps. Figure 5 was the customized delay circuit used for 
optimization and subsequently used by the system designer 
in his design. Table I shows the results of phase optimiza- 
tion under the following conditions: 

1 Inch Jumper 

Room Temperature 

Load @ 4 inches from CGS701 A output (see Figure 6 ) 

Tuning components, R1 = 50f!, R2 = 25f! 

TABLE I 



TABLE I 



Reference 
Frequency 


Vcc(V) 


Optimized 
Prop Delay 


25 MHz 


4.5 


±260ps 




5.5 


±210ps 


33 MHz 


4.5 


±265ps 




5.5 


±210ps 


40 MHz 


4.5 


±225ps 




5.5 


±205ps 



The effect over temperature (-40°C to -l-85°C) on phase 
shift can be anywhere from 50 ps-150 ps. This shift, as can 
be seen from Table I is about the same over the different 
input reference frequencies and supply voltages. 
Table II details the impact on phase delay by varying the 
input reference signal edge-rate, keeping its amplitude at 3V 
and feedback output rise time between 1.5 ns to 2.0 ns. 
Data is for 25°C, using the same loads as shown in Figure 8. 
With slower edge-rates the phase delay increases. In these 
examples, for an input reference with an edge rate of 2 ns a 
phase delay of 50 ps (zero delay) is realized. Conversely, for 
a 4 ns edge-rate the phase delay is considerable at 2.6 ns. If 
this were an actual edge-rate used in application then it 
would be possible to compensate for this phase delay using 
an RC network described earlier. 



Vref 
Rise Times 


Phase Delay 
Vcc = 5.0V 


2.0 ns 


50 ps 


4.0 ns 


1 .63 ns 


6.0 ns 


2.6 ns 



Table III below illustrates how phase delay is affected by the 
physical length of the feedback. The phase delay, for the 
loading described, is shown under two conditions: 1) with a 
straight jumper in the feedback path and 2) with an RC de- 
lay network in the feedback path. Comparing the results 
shows zero phase delay (70 ps) has been achieved using 
the RC delay network in the feedback path. Additional data, 
under the same conditions, shows what happens to phase 
delay when six and twelve inch feedback lengths are used. 

TABLE III 



Load: 200(1/100(1, see Figure 6 
Vcc = 5V, Freq = 40 MHz, Temp = 25°C 


Feedback 
Length 


Phase Delay 


Jumper 


R1 = 25andR2 = 50 


1 Inch 


210 ps 


70 ps 


6 Inch 


-400 ps 


-1.4ns 


12 Inch 


-1.7 ns 


-2.0 ns 



Table IV illustrates what happens to the phase delay when 
the distance of the loads from the clock output pins are 
different. The results, for a given 500f!/50 pF load, show a 
dramatic increase in phase delay from a load positioned 
three inches away and another twenty inches away. By se- 
lecting the correct delay components, adjustments can be 
made to zero out this large 1 .6 ns phase delay. 

TABLE IV 



Load: 500ri and 50 pF 
Vcc = 5V, Freq = 40 MHz, Temp = 25°C 


Feedback 
Length 


Load 
Distance 


Tuned 
Phase Delay 


2 Inch 


3 Inch 


-860 ps 


2 Inch 


7 Inch 


620 ps 


2 Inch 


20 Inch 


1.6 ns 



CONCLUSION 

This application note has discussed the high frequency 
challenges of clock distribution. We have shown how vary- 
ing trace lengths and loads at the end of clock traces can 
affect propagation delays. By using a real life system appli- 
cation, we have shown how using a customized delay ele- 
ment in the feedback path has allowed us to optimize prop- 
agation delays to within ±250 ps. 

This flexibility to optimize for low input-to-output phase delay 
in any application, gives PLLs a performance advantage 
that makes high frequency clock distribution applications 
very attractive for the system designer. 
For a copy of the CGS Design Databook (1995), please call 
our toll free hotline (800) 272-9959 and order Literature # 
400046. 
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LIFE SUPPORT POLICY 

NATIONAL'S PRODUCTS ARE NOT AUTHORIZED FOR USE AS CRITICAL COMPONENTS IN LIFE SUPPORT 
DEVICES OR SYSTEMS WITHOUT THE EXPRESS WRITTEN APPROVAL OF THE PRESIDENT OF NATIONAL 
SEMICONDUCTOR CORPORATION. As used herein; 

1. Life support devices or systems are devices or 2. A critical component is any component of a life 



systems which, (a) are intended for surgical implant 
into the body, or (b) support or sustain life, and whose 
failure to perform, when properly used in accordance 
with instructions for use provided in the labeling, can 
be reasonably expected to result in a significant injury 
to the user. 



support device or system whose failure to perform can 
be reasonably expected to cause the failure of the life 
support device or system, or to affect its safety or 
effectiveness. 
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